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Abstract 

Given a set A C TLjNTL we may form a Cayley sum graph Ga on vertex set TLjNTL by 
joining i to j if and only if i+j G A. We investigate the extent to which performing this 
construction with a random set A simulates the generation of a random graph, proving 
that the clique number of Ga is almost surely O(logA^). This shows that Cayley sum 
graphs can furnish good examples of Ramsey graphs. To prove this result we must 
study the specific structure of set addition on TLjNTL. Indeed, we also show that the 
clique number of a random Cayley sum graph on F = (Z/2Z)" is almost surely not 

o(iog|r|). 

1. Introduction. There are many papers in which a graph is constructed from a set in 
something like the following manner. 

Construction 1 Let T be a finite ahelian group with cardinality N and let A <^T . Define 
Ga to be the graph on vertex set V in which ij G E{Ga) precisely when i + j E A. 

For any positive integer m write = Z/mZ. In this paper we will be interested in the two 
particular groups F = Zjv and V = 'S^- 

Example. Suppose that is prime and let A be the set of quadratic residues modulo A^. 
The graph Ga is then called the Paley sum graph of order A^, and is denoted by Pjq- It is 
generally thought that the Paley graphs behave much like random graphs from the binomial 
model (7(7X^,1/2), though in many respects this statement is far from being proven. We 
remind the reader that in this model all (labelled) graphs on vertices occur with equal 
probability. Equivalently, a graph from (^(A^, 1/2) may be generated by picking each possible 
edge ij independently at random with probability 1/2. See for the basic definitions of 
random graph theory (and much more). 

In the light of the above example it seems natural to address the following question. 

Question 2 To what extent does choosing a random subset A C F and forming Ga simulate 
the selection of a random graph on N vertices ? 
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We call a graph generated using this model a random Cayley sum graph. The reader may 
have already noticed that there is one sense in which a random Cayley sum graph is definitely 
not like a random graph - it is always regular. However in a recent paper of Krivelevich, 
Sudakov, Vu and Wormald [12] a good quantity of evidence was collected to support the 
claim that a random regular graph of degree N/2 (say) behaves in many ways almost exactly 
like a random graph from the binomial model G{N, 1/2). 

In this paper we will investigate just one aspect of Questional the clique number of a random 
Cayley sum graph. There are clearly other ways in which we might compare a random Cayley 
sum graph to a random graph. An investigation of the second largest eigenvalue, for example, 
leads naturally to some classical questions on random trigonometric polynomials. We discuss 
this matter in a separate paper [Til] . 

This is really a paper in combinatorial number theory. To see why, let A C F and consider 
what it means for X C F to span a clique in Ga- We must have i + j & A whenever i,j are 
distinct elements of X. Now the set {i + j '■ i,j G X, i 7^ j} is called the restricted sumset 
of X with itself and is normally written X+X. Thus we see that X spans a clique in Ga if 
and only if X+X C A. 

Now if X is highly structured, for example if it is an arithmetic progression, then |X+X| is 
rather small compared to |X| and the probability that Ga contains the clique spanned by 
X will be quite large. This does not mean, however, that the clique number of a random 
Cayley graph is very large since there are rather few arithmetic progressions. 

Let k,m E N, and make the following definition. 

Definition 3 5'^(F) is the collection of sets X C F with \X\ = k and |X+X| = m. 

Observe that 5'™(F) is empty if m < k — 1. Now one can easily write down a formula for 
E(T, k), the expected number of /c-cliques in a random Cayley graph on F. Indeed one has 

E{r; k) = ^ P(X is a clique in Ga) 

XCV:\X\=k 

= J2 H nx+xcA) 

m>k-l X£SJp{r) 

m>fc-ixes™(r) 

= Yl i^r(r)i2-™. 

m>fe— 1 

Now suppose we could show, for some function F = Fr : N — N, that E{r, F{N)) = o(l). 
Writing u for the clique number and Qk for the number of cliques of size k we would then 
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have 

F{uj>F{N)) = F{nFiN)>l) < E{T,F{N)) = o(l) 

by Markov's inequality. That is, the chque number of a random Cayley graph on F is almost 
surely at most F{N). As a result of these observations our attention is drawn to the problem 
of finding effective upper bounds for |S'™(r)|. 

Further notation and background. Let |r| = and consider the sample space Qn 
consisting of all graphs on vertices. If E = En is an event in this sample space then we 
write 

• P[an(-^) for the probability that a random graph from G{N, 1/2) satisfies E; 

• 'P^^g{E) for the probability that a graph selected from Gn,in/2\ satisfies E, where 
QN,[N/2i puts the uniform distribution on all the [A^/2j -regular graphs on A^ labelled 
vertices; 

• Pr(-E) for the probability that if A C F is chosen randomly, then the Cayley sum graph 
Ga satisfies E. 

The following well-known result concerning the clique number of a random graph (see, for 
example, jB]) sets our later results in context. In this paper, all logs are to the base 2 unless 
otherwise specified. 

Theorem 4 For any fixed e > we have 

lim P{.aii((2-e)logA^ < uj < (2 + e)logA^) = 1. 

A''— >oo 

In ^2] the same result is proved for random regular graphs: 
Theorem 5 For any fixed e > we have 

lim Ptee((2-e)logA^ < oo < (2 + e)logA^) = 1. 

Questions equivalent to estimating the clique number of a random Cayley sum graph appear 
in various places in the literature^, such as [H 13 El HI Ulj- To the best of my knowledge the 
following result, which is essentially given in P], is the best known. 

Theorem 6 There is an absolute constant C such that 

lim Pz^ [uj < C(logAr)2) = 1. 

^The author would like to thank Benny Sudakov for drawing his attention to the two papers ^E]. 
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A similar result is mentioned in P^, where it is remarked that it would be interesting to try 
and improve on it, and also, without proof, in [T^ . 

Let us conclude this section by making a remark concerning our terminology. We use the 
term Cayley sum graph to make it clear that the objects we are considering are very slightly 
different from what are normally called Cayley graphs. In an abelian group F a (true) Cayley 
graph is formed as follows. Take a symmetric set A, that is to say a set in which a E A implies 
that —a G A. Form a graph on vertex set F by joining i to j if and only if i — j E A. The 
group F acts on the resulting graph in an obvious way, and for this reason true Cayley graphs 
are perhaps more natural than Cayley sum graphs. From the rather number-theoretical point 
of view of this paper, however, the treatment of Cayley sum graphs seems to be a little tidier. 
The methods of this paper apply to true Cayley graphs with minimal modifications. When 
F = Z2, of course, the two types of graph actually coincide. 

2. Statement of results. The main result of this paper is the following improvement of 
Theorem IHl 

Theorem 7 We have 

lim Fz^ {u < 160 log A^) = 1. 

An immediate consequence of Theorem [7| is the following result in Ramsey theory, for which 
we do not know an independent proof: 

Theorem 8 For all sufficiently large integers N there exists a set A C for which the 
Cayley sum graph Ga has no cliques or independent sets of size 160 log A^. 

The behaviour of random Cayley sum graphs on is different, reflecting the fact that this 
group has much richer subgroup structure. We will prove the following. 

Theorem 9 There are absolute constants Ci,C2> such that 

lim P^n (CilogA^loglogA^ < uj < C2 log A^ log log A^) = 1, 

where A^ = 2" = jZ^j. 

We make no effort to optimise the values of Ci and C2 in this result. 

3. Preiman isomorphism classes. A vital notion in this paper is that of a Freiman 
isomorphism. Let s be a positive integer, let A and B be subsets of (possibly different) abelian 
groups and let : A —> 5 be a map. Then we say that (f) is a Freiman s-homomorphism if 
whenever oi, . . . ,as,a[, . . . ,a'^ E A satisfy 

ai + a2 H \- Os = a[ + 02 + h (1) 
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we have 

0(ai) + 0(02) + ■■ - + 0(0,) = 0(a;) + 0(a'2) + --- + 0K). (2) 

If (j) has an inverse which is also a s-homomorphism then we say that it is a Freiman s- 
isomorphism, and write A=sB. This is the case precisely if holds for any a^, a[ satisfying 

This is a standard and very convenient terminology in additive number theory. One must, 
however, be slightly careful: we invite the reader to consider the obvious map cf) : {0, 1} Z2, 
which is a one to one Freiman 2-homomorphism but not a Freiman isomorphism. 

In this paper we will mostly, but not exclusively, be concerned with Freiman 2-homomorphisms. 
From now on we will refer to these simply as homomorphisms. 

In order to discuss the theory of Freiman homomorphisms as cleanly as possible we introduce 
some further definitions. These are very much in the spirit of ^1], and indeed Lemma fTTl 
below is a crude (though slightly more general) version of one of the results of that paper. 

Let F be a field, let /c be a positive integer and let W he a. vector space over F. Let Vk be a 
A;- dimensional vector space over F with basis {ei, . . . , e^}. For any sequence a = (ai, . . . , ak) 
of k elements of W we can define a linear map (pa '■ Vk ^ W hj 

If f G ker (j)^ then we say that v is satisfied by a. Let s > 2 be an integer. Call any vector of 
the form 

Cii + ■ ■ ■ + Cis ~ Cji — ■ ■ ■ — Cj^ , 

where the is and js need not be distinct, an s-relation. Write TZs for the set of all s-relations 
in Vk- If 5" C Vfc write Span(S') for the F-linear span of the elements of S. 

Now for any subset S and any subspace ?7 of a vector space one has S'nSpan(S'nf/) = S'flf/. 
Thus in particular 

IZs n Span {IZs n ker = Tis^ ker cp^. (3) 

That is to say, the set TZg fl ker (()„ of s-relations satisfied by a can be completely recovered 
from the subspace spanned by it. 

Lemma 10 Suppose that a = (aj)jL;^ and a' = (0^)^=1 are two sequences of distinct el- 
ements of W , and suppose that Span (7^^ fl ker 0o-) = Span (7^^ fl ker 0o-')- Then the sets 
A = {tti, . . . , Qk} and A' = {ai, . . . , a^} are s-isomorphic. 

Proof. By Q we have TZs fl ker = T?.^ n ker (f)^'- That is, the set of s-relations satisfied by 
a and a' are the same. An s-isomorphism between A and A' is therefore given by the map 
Ui I— s> a-. □ 
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Lemma 11 The number of s -isomorphism classes of subsets ofW of size k is at most k'^^^ . 

Proof. Given any set A = {ai, . . . , a^} <Z W we may form a sequence a = (ai, . . . , ak). By 
Lemma ITUl the number of s-isomorphism classes of sets A is at most the number of subspaces 
of Vk spanned by s-relations. There are at most fc^* of these s-relations, and any such subspace 
is spanned by some subset of at most k of them. The required bound follows. □ 

Now if W,W' are two vector spaces over F and ii A (1 W then the space B.oms{A,W') 
of Freiman s-homomorphisms (p : A ^ W is a vector space over F. We call rF{A) = 
dimHom2(A, F) — 1 the Freiman dimension of A. It is easy to see that dimHom2(A, W) = 
{rp{A) + 1)^1™^ . In a short while we will prove the main result of this section, Lemma IT^ 
Let us first prepare the ground with the following lemma from linear algebra. 

Lemma 12 Let F be a field, let A be a finite non-empty set, and let ^ be a subspace of^, 
the vector space of all maps from A to F. Write d = dim$. Then there exists a subset 
{ai, . . . , ad} C A with the following properties. 

(i) For any fi,...,fd&F there is a unique G $ with (f){ai) = fi {i = 1, . . . ,d). 

(ii) If a & A then there are Ai, . . . , G -F {depending only on a) such that (f){a) = Ai0(ai) + 
■ ■ ■ + Xd4>{0'd) for any G $. If the constant map, which sends everything in A to Ip, lies in 
$ then we may assume that Ai + ■ ■ ■ + A^ = 1. 

Proof. Let A = {ai, . . . , a^}. Choose as a basis for the set {ai, . . . , a^} of maps defined 
by the conditions ai{aj) = 6ij. Now we may find some k — d oi the which, when taken 
together with $, span the whole of ^. Relabeling if necessary we may assume that these 
elements are a^+i, • • • , c^k, so that 

^' = $©Span(arf+i,...,afc). (4) 

It follows that for any /i, . . . , G -F we may find G $ and /i^+i, . . . , /ifc G -F so that 

/itti H h fd<yd = 4> + /id+iftd+i H h fikOik, 

implying that 0(aj) = /j (z = 1, . . . , d). To see that (p is unique, suppose that 4>'{ai) = 4>"{ai) 
for some 0', 0" G $ and all i = 1, . . . ,d. Then 0' — 0" G Span{ad+i, ■ ■ ■ , tt/t) and thus 
0' — 0" = since the sum in (j3)) is direct. 

As for assertion (ii), observe that any x G A induces a linear functional t^. : ^ ^ F via 
the rule ta;(0) = 0(2^)- For a fixed a G A the d+1 linear functionals ta, ta^, . . . , ta^ are linearly 
dependent and therefore we can find /i, /ii, . . . , /i^ G -F, not all zero, so that 

/i0(a) + /ii0(ai) H h/id0(ad) = (5) 

for any G $. If fi were equal to zero then we could easily contradict (i) by setting /« = 1 
and fj = for j ^ i. Therefore this is not the case and we may divide (0) through by fi to 
get (ii) (with Aj = for i = 1, . . . ,d). If the constant map lies in $ then Q must hold 

for it, which implies that Ai + -- - + A(i = l. □ 
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Lemma 13 Let W and W he finite- dimensional vector spaces over a field F and let A C 
W he a finite non-empty set of Freiman dimension r = rp{A). Then there is a subset 
{ai, . . . , a^+i} C A with the following properties. 

(i) For any w[,...,w[.^i G W there is a unique (p G B.om2{A,W') with 4){ai) = w[ {i = 
1, . . . , r + 1). Moreover, if w'l, . . . , if^+i are linearly independent then cf) is an isomorphism. 

(ii) If a E A then there are Ai, . . . , Aj.+i G F {depending only on a) with Ai + ■ ■ ■ + A^+i = If 
and such that (f){a) = Ai0(ai) + ■ ■ ■ + Xr+i4>{ar+i) for any (p G Hom2(v4, W). 

Proof. Part (ii) and the first part of (i) are swift consequences of Lemma I12| (i). Although 
that lemma deals only with homomorphisms from A to F, the required extension follows 
easily using the isomorphism Hom2(A, F") = (Hom2(A, F))". 

To prove the second assertion of (i), suppose that suppose that G Hom2(A, W) is not 
a Freiman isomorphism. Then there are elements a, a', a", a'" G A such that (/)(a) + 0(a') = 
0(a") + 4>{a"') but a + a' 7^ a" + a'". By part (ii) there are Ui,u'i,u'- ,u'-' G F such that 
V'(q-) = ^1=1 Uitp{ai) for all ip G Hom2(v4, V), for any vector space V over F (and similarly 
for a', a", a'"). One has 

r+l 

ip{a) + ip{a') - ip{a") - 'ip{a"') = ^^{ui + u[ - u'l - ul')ip{ai) (6) 

i=l 

for all ip. Taking V = W and ip = id we see that not all of the Ui -\- u'^ — u'- — u'-' are zero. 
Therefore, setting V = W and ip = (p, one sees that (p{ai) , . . . , (p{ar+i) must be linearly 
dependent. □ 

Corollary 14 Let A . Then r = rplA) is the largest integer d such that A is isomorphic 
to a suhset of F^ which is not contained in an affine subspace. In particular if A is a set of 
integers then rQ{A) = d really is equal to dimension in the original sense of Freiman (Hj- 

Proof. Apply Lemma El (ii) with W = W and (p = id. It follows that there are 
{ai, 02, ... , ttr+i} such that A is a subset of {AiOi + ■ ■ ■ + Ar+ia^+i : Ai + ■ ■ ■ + A^+i = If}, an 
affine space over F of dimension r. As rplA) is clearly an isomorphism invariant, the same 
must be true of any set isomorphic to A and it therefore follows that d < r. To see that 
r < d, pick linearly independent vectors w[, . . . , ti'r+i some space W, and observe that A 
is isomorphic to some set containing all of the w'^ by Lemma IT!^ (i). □ 

4. Isomorphism classes with small doubling. As in the last section, let be a vector 
space over a field F. The main result of this section is a bound for the number of mutually 
non- isomorphic sets A (1 W of cardinality k with small (restricted) doubling property. The 
cardinality of A+A is an isomorphism invariant and we say that an isomorphism class of sets 
has small doubling property if m = is relatively small in comparison to \A\ for each 

set in that class. 

li B ^ W and if t is a non- negative integer than a t- extension of B is simply any set 5 U F 
where F = {xi, . . . , Xt} and the Xi are distinct elements of W \ B. 
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Proposition 15 Suppose that A C. W has sufficiently large cardinality k — \A\ and let 
m = \A+A\. Then there exist a* e A and Ao,Ai C A such that \Ao\ < Ak~'^/'^^m, \A\Ai\ < 
Ak-^/^m and a* + Ai C Aq+Aq. 

Proof. Let Q = [/c^/^J and q — k~^/^^. Draw a graph on vertex set A by joining x to y if 
X -\-y is Q-unpopular, that is if the number of pairs {w, z) & A^, w ^ z, with w + z — x + y 
is less than Q. The total number of edges in the graph is no more than Qm, and so some 
vertex a* has degree at most 2Qm/k. This means that with at most 2Qm/k exceptions the 
sums a* + a, a E A are Q-popular. Pick a set X C A by choosing each a E A to lie in 
X independently and at random with probability q. Let Z — {a E A\a* + a ^ X+X}. If 
s = a* + a is Q-popular then there are at least Q/3 disjoint pairs {b, c) e A^ with b + c — s 
and b c. For each the probabihty that both b and c lie in X is q'^, and so 

P(a e Z) < (1 - g2)0/3 < 

It follows that 

mz\ < ^ + e-<^'«/3A; < 3k-^/'m. 
k 

Clearly we have E\X\ — qk — k^^/^^ and so 

E(|X| +A;^^/i5|Z|) < 4A;-^/^^m. (7) 

Pick a specific X C A for which \X\ + k^^'^^\Z\ < Ak'^/^^m, and set Aq ^ X, Ai ^ A\Z. 
Then \Ao\ < Ak-^/^^m, \A\Ai\< Ak-'^l^m and, by construction, a* + Ai C Aq+Aq. □ 

Lemma 16 Fix a non-negative integer t and a set B C.W of cardinality \B\ — I. Then the 
number of mutually non-isomorphic sets A of cardinality \ A\ — l + t, such that there exists a 
subset Aq C. A satisfying Aq =3 B, is at most Z^* . 

Proof. Write B — {bi, . . . ,bi}. Given Aq C A with the required properties write ^0 = 
{ai, . . . ,ai} and A = Aq U {a;+i, . . . , a;+t} in a manner that respects the 3- isomorphism 
between and B; that is, for ii, ^2, is, i^, i^, ie & {1, ■ ■ ■ ,1} 

K + K + bis - K - K -K =0 

if and only if 

tt^i ~1~ ~t~ ^i's ^if3 0* 

The isomorphism class of A is determined by the 2-relations in VJ+t satisfied by A. With 
each possible relation we can associate its projection w E Vt onto the last t coordinates of 
Vi+f There are at most t^ non-zero projections w; fix one of them, say w — {1,0, ... ,0), and 
consider relations corresponding to this specific projection. If A satisfies such a relation, say 

C^ii + 0,1+1 = 0,12 + (8) 
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with ii, 22, "^3 £ {1, • • • , 0) then all other such relations are uniquely determined: they corre- 
spond to the equalities + a^g — = a^^ + a^-g — a^^ with ji, j2, js ^ {1, • • • > 0; hence to the 
equalities bi^ + hi^ — hi^ = bj^ + bj^ — bj^ with ji, j2, js G {1, . . . , I}. Since there are at most 
choices for the relation (jH)), there are at most choices for the whole system of relations 
corresponding to w which A satisfies. Moreover, it is not difficult to see that this conclusion 
is actually true for any possible w ^ Q. The result follows on observing that to determine 
the system of relations in VJ+f, satisfied by A, we must for each non-zero w EVt determine 
the system of relations corresponding to w. □ 

The next lemma is an easy exercise. 

Lemma 17 Suppose that X X' . Then X+X ^3 X'+X' and any B C X+X is 3- 
isomorphic to a subset of X'+X'. 

Proposition 18 Let k be a sufficiently large positive integer and let m < k^^^^^ . Then 
the number of isomorphism classes of A (1 with \A\ = k and \A-\-A\ < m is at most 
{em/kY exp (A;^^/^^) . 

Proof. By Proposition ITK| to any A with 1^41 = k and < rn there correspond 

a* e A and Aq, Ai C A such that |v4o| < Ak'^/^^m, \Ai\ > \A\ - Ak~'^/^m and a* + Ai C 
Aq+Aq. By Lemma ITTl the number of possible 6-isomorphism classes to which Aq can belong 
is M < exp let Xi, . . . ,Xm be representatives of these classes. By Lemma ITTl the 

set a* + Ai is 3-isomorphic to a non-empty subset of some Xj+Xj of cardinality at most 
k] consequently, the number of 3-isomorphism classes to which Ai can belong is at most 
Mk{em/k)^. By Lemma ITHt each of these 3-isomorphism classes gives rise, by t-extensions 
for any fixed t < Ak'^^^m, to at most exp(A;^^/'^°) isomorphism classes. Altogether, we get at 
most 

exp(fc30/3i) . . 4fc-4/5^ . exp(/t2^/30) ^ (em/k)'' expi^^^^^) 

possible isomorphism classes for A. □ 

5. Avoiding very small doubling. The bound of PropositionllSlbecomes rather ineffective 
when m < 7k, say. This section contains a trick which allows us to avoid ever considering 
sets with extremely small doubling (see, however, the section entitled Concluding Remarks). 
The result we will actually use. Corollary EDI is a consequence of the following more general 
proposition. 

Proposition 19 Let k be a sufficiently large positive integer and let A be a subset of an 
abelian group with \A\ = k. Let e > be sufficiently small. Then there is a set B (1 A with 

IB| < '^MlMvk 

e 

and \B+B\ > (1 - e)k. 
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Proof. Let p = 3e^^ ^y\og{Q / e) /2k. Choose a set -B C A by picking each element of k 
independently and at random with probability p. By some rather standard tail estimates 
(see, for example, jH Theorem A. 1.4]) we have 

F{\\B\-pk\ > epk/3) < 2e-2^'p'^/^ = e/3. (9) 

Now for any x G B+B write s{x) for the number of ordered pairs (6i, 62) ^ B'^ with bi 7^ 62 
and 61 + 62 = X. The sum M{B) = s(x)^ equals the number of quadruples (61, 62, 63, 64) G 
5'^ with 61 + 62 = 63 + 64 and 61 7^ 62, 63 7^ 64- We call these summing quadruples. 

Let us divide the summing quadruples (ai, 02, as, 04) G A'^ into two classes: those in which 
01,02,03,04 are all distinct and those in which either oi = 03,02 = 04 or oi = 04,02 = 03. 
Call this latter type degenerate; there are less than 2k'^ degenerate quadruples in A^, and at 
most k^ non-degenerate ones. Now for a given quadruple, one can calculate the probability 
that it is also contained in B^. This probability is if the quadruple is non-degenerate, and 
p^ otherwise. Thus 

EM{B) = E(^^s(x)2) < 2A;V + A;V. 
It follows immediately that 

P [MiB) < ^kVj-jy ^ > ,/3. (10) 
Comparing ^ and |T0|) . we see that there is a specific choice of B for which 

\\B\-pk\ < epk/3 and M{B) < f / . (11) 

1 6/0 



Now the Cauchy-Schwarz inequality gives 



x(^B+B 



\B+B\ 



\b\Wb\ 




■If 




\B+ 


B\ 





This, together with (jllj) and a small amount of computation, shows that |-B+i?| > {1 — t)k. 
Moreover we have, from (fTT|) . that 

\B\ < pkil + e/3) < ^-^^^Vk. 

This completes the proof. □ 

Corollary 20 Let k be a sufficiently large positive integer and let A be a subset of an abelian 
group with \ A\ = k. Then A contains a subset C, \C\ > k/8, with \C+C\ > 7\C\. 
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Proof. Apply Proposition 1191 with e = 1/9. This supphes us with a set B, \B\ < 100\/fc, 
such that > 8k /9. Now simply take C to be the union of B and some arbitrary 

elements of A so as to ensure that \C\ lies between k/8 and 8k/ 63. □ 

6. The rank of sets with small doubling. In this section we obtain results concerning 
the Freiman dimension of sets ACT with = and 1^4+^4 1 = m. It is in this section that 
the difference between different groups F comes to the fore. When F = Z^r we can relate A 
to a subset of the integers, but this is not possible for F = and, by necessity, we must find 
a different approach. 

Let A C Zjv have |y4| = A; and 1^4+^4 1 = m and consider the image A C Z of A under the 
least positive residue or "unfolding" map Zjv "-^ {1, . . . ,N}. This clearly has 1^41 = k and 
l^+AI < 2m. Thus we may apply the following result of Freiman [S, page 24]. 

Lemma 21 (Freiman) Let 5* C Z have cardinality k and Freiman dimension r = tqIS). 
Then we have the inequality 

\S+S\ > r . 

Lemma 22 Suppose that A C Z^r has \A\ = k and \A-\-A\ = m, and let A be the image of 
A under the unfolding map Za? "-^ Z. Then we have the hound r<[^{A) < Am/k. 

Proof. Observe that r = r{A) < k — 1, this being a general property shared by all sets of k 
integers. Applying Lemma 1^ we have 

, , / r + l\ 1 

2m > \A+A\ > r \ k > -kr. □ 



Proposition 23 We have the bounds 



ifm< and 



whatever the value of m. 



|5r(Z^)| < N'-^'-f (^yexp(A:^V^2) (^2) 



\S'^{Zn)\ < N'+^k^' (13) 



Proof. It suffices to count the number of lifts A of sets A G S'™(Ziv). Let Xi, . . . , Xm be a 
complete set of representatives of the isomorphism classes of subsets X of the integers with 
|X| = k and |X+X| < 2m. We always have M < k"^^ by Lemma ITT] but, when m < /c^-^/^°/2. 
Proposition ITHl allows us to do better and we have the bound M < (2em/fc)*^ exp(/c^^/^^). 
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Now each A is the image of some Xj under some isomorphism ip : Xj — {1, . . . , N}, and 
by Lemma ^1 (i) there are at most N'^'^^^^^'^^ such isomorphisms. The proposition follows 
immediately from these observations and Lemma l22l □ 

We now turn our attention to subsets of An important tool will be the following conse- 
quence of an inequality of Pliinnecke which was reproved by Ruzsa jTS| . If G is any abelian 
group and if X C G then we write kX for the fc-fold sum X + X + ■ ■ ■ + X and kX — IX for 
the set X + ■ ■ ■ + X — X — ■ ■ ■ — X, where there are k — 1 plus signs and / difference signs. 

Theorem 24 (Pliinnecke - Ruzsa) Let A <Z G be such that \A + A\ < C\A\. Then for 
any positive integers k and I one has 

\kA-lA\ < C^+^\A\. 

This allows us to get a bound on the Freiman dimension over Z2 of subsets of Z2 with small 
doubling. 

Lemma 25 Suppose that A C Z2 has cardinality k and \A+A\ = m. Then ri^{A) + 1 < 
4m log k/k. 

Proof. By Corollary we may assume (by subjecting A to a Freiman isomorphism if 
necessary) that A is not contained in any affine subspace of Z2 of dimension less than ri^^{A). 

Let -< be an arbitrary ordering on Z'2 and let h = [logg k\. Suppose that |y4| = and that 
l^+^l = m, so that |y4 + y4| < m + 1 (we are in characteristic 2!), and let X C A be a subset 
of A which is such that the sums xi + X2 + ■ ■ ■ + Xh {xi -<■■■-< Xh) are all distinct, and which 
is maximal with respect to this property. If |X| > h then we have, using Theorem 1^ 

< 
< 

< 

It follows immediately from this that 

|X| < 2mhk-^+^'^ < Am\ogk/k. (14) 

Now the maximality of X implies that if we set Y = XU {a} for any a E A\X then some two 
of the sums j/i H — ■ + yh, yi + ■ ■ ■ + yh must be equal, where yi ^ ■ ■ ■ ^ yn and -<■■■-< j/^. 
It is clear that precisely one of the ?/j's or y'^s must equal a, and so a G hX — {h — 1)X. In 
particular, A is contained in the affine subspace of Zg spanned by X. The required inequality 



X 
h 
\hX\ 
\hA\ 
m + 1 
k 
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follows. □ 

It is interesting to compare this with the estimate 

rz2{A) + l < hgk + (m/k)'^ + 2\og{m/k). 

This bound, which follows from the work of Ruzsa jT^I on Freiman's theorem in torsion 
groups, is stronger than Lemma 1^ when m/k -C (logfc)^/^. Ruzsa and I, in work in progress, 
can improve this bound to something more like log A; + 2(m//c)^. It would be interesting to 
know the true behaviour: perhaps r < log A; + Cm/k. 

Proposition 26 We have the bounds 

, , 4mlogfe / eTfl \ ^ / 01 /on\ 

l^rrai < exp(pi/32) (15) 



ifm< A;3i/30 

|5™(Z^)| < N^^k^'' (16) 
whetever the value of m. 

Proof. This is slightly more straightforward than Proposition OHl Let Xi, . . . , Xm be a 
complete set of representatives of the isomorphism classes of sets X C Z2 with |X| = A; and 
< m. Proposition ^1 gives bounds for M, and we know that any set A G S^iTH^ is 
the image of some Xi under an isomorphism t/; : Xj — > Lemma El (i) and Lemma OKI 
allow us to bound the number of such isomorphisms. □ 

7. Putting everything together. In this section we use Propositions EHl and ISEl to give 
upper bounds for the sums 

E i^r(r)i2-™ 

m>7fc 

when r = and Z2 . An application of Proposition will then allow us to prove Theorems 
13 and We begin with Theorem [7| Let k = [201ogXj. Using ^ and (0 we have 

[fe31/30/2j fc(fc-l)/2 

m>7k m=7k ^= |^;,3i/30/2j 

Lfc31/30/2j 

< J2 2™((^+^)'°s^+^'°s(2r)-i+o(i)) ^ J2 2'"(^-^+°«) 

m=7k m>[A:3i/30/2j 

Now log(2eC)/C — 1 < —0.2499 when C > 7, and so both of the sums appearing here are 
bounded above by N^"^ for N sufficiently large. 
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Now suppose that some randomly selected set A C Zjv contains X+X with |X| = k. Then, 
by Proposition IT^ A contains some set Y+Y where |y| > k/8 and > 7\Y\. However 

if k = [160 log A^] then the expected number of such sets Y that A contains is at most A^~^, 
and in particular the probability that it contains even one is o(l). It follows immediately 
that almost surely A does not contain any X+X with |X| = [160 log A^], and this is the 
claim of Theorem [7| □ 

Similar computations apply for the group Z2, but now of course we use formulae ()15|1 and 
(fT^ of Proposition 1^ instead. Let k = [11 log A^loglog A^J . One has 

[fc31/30/2j k{k-l)/2 

m>7k m=7k m= [^^1/30/2] 

Lfc31/30/2j 

m=7fc m>[fe3l/30/2j 

Now log(eC)/C — 1 < —0.39 when C > 7, and so both sums appearing here are bounded 
above by A^~^ for N sufficiently large. The rest of the argument proceeds exactly as before, 
and one gets Theorem IHl (with C2 = 308). □ 

8. A lower bound for Zg. In this section we prove the lower bound in Theorem IHl 
Throughout this section we write T = 71^ and N = \T\. 

Proposition 27 

lim Pr(c<; > ^ log log log A^) = 1. 

Proof. Let k = [logn + log log n — Ij and consider all /c-dimensional subspaces Hi, ... , Hm 
of r, where 

^ ^ (2"-l)(2"-2)...(2"-2^-^) ^ 
(2^' _ l)(2fc - 2) . . . (2*= - 2^^-!) - 

For a random set A C F let Xj denote the indicator function of the event Hi \ {0} C A, and 
let X = Xi H h Xa/ . We have 

EX > 2''''-^^-'^'. (17) 
We will also estimate Var(X), and to do this it is necessary to prove a lemma. 
Lemma 28 The number of pairs for which dim(i^i fl Hj) = l,0<l<k, isat most 

2n{2k-l) 
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Proof of Lemma. We will show that the quantity in question is exactly 

(2" - 1)(2" - 2) . . . (2" - 2^''-^-^) 
(2^ - 2^2 . . . (2^= - 2^-1)2(2^ - 1)(2' - 2) . . . (2^ - 2^-i) ' 

which is easily seen to be at most 2"(2fc-0. To make this count, first pick the intersection 
L — HiO Hj in any of 

(2" - 1)(2" - 2) ... (2" - 2'-i) 
(2^-l)(2'-2)...(2^-2'-i) 

ways. Then pick vectors Ui, . . . , Uk-i, f i, . . . , Vk-i so that Hi = Span(L, Ui, . . . , Uk-i) and Hj = 
Span(L, vi, . . . , Vk-i)- The subspace L together with all the points Wj, Vi spans a subspace of 
dimension 2k — Z, so the number of choices for the itj, Vi is 

(2" - 2')(2'' - 2'+^) ... (2" - 2^^-^-^). 

However, many of these choices will give the same subspaces Hi, Hj. If H^, Hj are fixed and 
have Hi D Hj = L then there are (2^^ — 2') . . . (2*^ — 2^^"^) ways of choosing wi, . . . , Uk-i so 
that Hi = Span(L, ui, . . . , Uk-i), and the same number of ways of choosing vi, . . . , Vk-i- This 
completes the proof of the lemma. □ 

Now the covariance Cov(Xj, Xj) is positive if and only if Hi \ {0} intersects Hj \ {0}, which 
requires dim(i7j Hj) > 1. Thus we have, using the lemma. 



Var(X) = ^Cov(Xi,X^) 

k 

< 2 5];2"(2fc-02-2'=+^ (^2^' -l) 
1=1 

k 

^ 2 22fc"-2''+^+2'-ni 



Now one can check that 2'^ — nl < 2 — n ior 1 < I < k, and so 

Var(X) < 8fc2(2'=-^)"-2'+\ 
Now Chebyshev's inequality tells us that 

m = 0) < ^ < 8A;22^-^ 

which is o(l). This means that with probability 1 — o(l) the set A contains some Hi \ {0}. 
Note, however, that Hi \ {0} is precisely Hi+Hi. □ 
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9. Concluding remarks. It would certainly be of interest to determine the clique number 
of a random Cayley sum graph on Z^r asymptotically (almost surely). There are several 
points at which are argument leading to Theorem [3 was suboptimal. One was that we 
rather shied away from the issue of counting sets with extremely small doubling property, 
resorting instead to the trick of Proposition El It is possible to say rather more about these 
using Freiman's deep theorem [Hj, which says that such sets are economically contained in a 
multidimensional arithmetic progression. It was also rather wasteful to treat subsets of Z^v 
by using the unfolding map Z^v "—>■ {1, . . . , A^} in such a naive way. When is prime, it is 
possible to prove the following result: 

Proposition 29 Suppose that N is prime and that A is a subset of Z^v with \A+A\ < 
(logiV)V32. Then there are A G Z^,/i G Z^v for which + ^ C {0, . . . , [N/2\}. 

By subjecting our set A to such a homothety one can ensure that the unfolding map induces 
a Freiman isomorphism, so that the lifting A has = m. Proposition EHl may be called 

a rectification principle in the spirit of jSj- 

Using these results, and tidying up the approximations used in, for example. Lemma l22| one 
can certainly reduce the number 160 drastically, and perhaps even to (3 + e). To do this 
would be difficult, and there seem to be some even more substantial obstacles to pushing the 
constant down to (2 + e), which seems likely to be the correct value. We may return to these 
issues in a future paper. 

Another issue deserving of remark arises from the result of Graham and Ringrose [H], which 
implies that there is an absolute constant c such that, for infinitely many primes N, the 
clique number uj{Pn) is at least c log log log log N. However, using the Borel-Cantelli lemma 
together with the bounds obtained in §7 one can show that almost surely any sequence {Gn)n 
of random Cayley sum graphs on Z^r, \Gn\ = N, will have lim sup j^^^lj{Gn)/ logN < 160. 
Thus Paley graphs are (in a sense) not as good Ramsey graphs as random Cayley sum graphs. 

Let us close by mentioning a conjecture from 

Conjecture 30 ( Alon) There exists an absolute constant b such that for every group F on 
n elements there is a Cayley graph on this group containing neither a complete subgraph nor 
an independent set on more than blogn vertices. 

The results of this paper confirm this conjecture for cyclic groups but one should not expect 
the full conjecture, if it is true, to be easy to prove. Quite apart from the fact that our 
methods seem to break down hopelessly for all but a few very special groups, the result of 
Proposition UTl shows that random methods alone will not suffice to attack Alon's conjecture. 

10. Acknowledgements. I would like to thank Imre Ruzsa and Seva Lev for their generous 
help in improving the exposition in this paper. 
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